A Revisit of Block Power Methods for Finite State Markov Chain Applications
نویسندگان
چکیده
According to the fundamental theory of Markov chains, under a simple connectedness condition, iteration of a Markov transition matrix P , on any random initial state probability vector, will converge to a unique stationary distribution, whose probability vector corresponds to the left eigenvector associated with the dominant eigenvalue λ1 = 1. The corresponding convergence speed is governed by the magnitude of the second dominant eigenvalue λ2 of P . Due to the simplicity to implement, this approach by using power iterations to approximate the dominant eigenpair, known as the power method, is often applied to estimate the stationary distributions of finite state Markov chains in many applications. In this paper, we revisit the generalized block power methods for approximating the eigenvector associated with λ1 = 1 of a Markov chain transition matrix. Our analysis of the block power method shows that when s linearly independent probability vectors are used as the initial block, the convergence of the block power method to the stationary distribution depends on the magnitude of the (s+1)th dominant eigenvalue λs+1 of P instead of that of λ2 in the power method. Therefore, the block power method with block size s is particularly effective for transition matrices where |λs+1| is well separated from λ1 = 1 but |λ2| is not. This approach is particularly useful when visiting the elements of a large transition matrix is the main computational bottleneck over matrix–vector multiplications, where the block power method can effectively reduce the total number of times to pass over the matrix. To further reduce the overall computational cost, we combine the block power method with a sliding window scheme, taking advantage of the subsequent vectors of the latest s iterations to assemble the block matrix. The sliding window scheme correlates vectors in the sliding window to quickly remove the influences from the eigenvalues whose magnitudes are smaller than |λs| to reduce the overall number of matrix–vector multiplications to reach convergence. Finally, we compare the effectiveness of these methods in a Markov chain model representing a stochastic luminal calcium release site.
منابع مشابه
Relative Entropy Rate between a Markov Chain and Its Corresponding Hidden Markov Chain
In this paper we study the relative entropy rate between a homogeneous Markov chain and a hidden Markov chain defined by observing the output of a discrete stochastic channel whose input is the finite state space homogeneous stationary Markov chain. For this purpose, we obtain the relative entropy between two finite subsequences of above mentioned chains with the help of the definition of...
متن کاملDiscrete-time Markovian Stochastic Petri Nets
We revisit and extend the original definition of discrete-time stochastic Petri nets, by allowing the firing times to have a “defective discrete phase distribution”. We show that this formalism still corresponds to an underlying discrete-time Markov chain. The structure of the state for this process describes both the marking of the Petri net and the phase of the firing time for of each transit...
متن کاملAN APPLICATION OF TRAJECTORIES AMBIGUITY IN TWO-STATE MARKOV CHAIN
In this paper, the ambiguity of nite state irreducible Markov chain trajectories is reminded and is obtained for two state Markov chain. I give an applicable example of this concept in President election
متن کاملANALYSIS OF FINITE BUFFER RENEWAL INPUT QUEUE WITH BALKING AND MARKOVIAN SERVICE PROCESS
This paper presents the analysis of a renewal input finite buffer queue wherein the customers can decide either to join the queue with a probability or balk. The service process is Markovian service process ($MSP$) governed by an underlying $m$-state Markov chain. Employing the supplementary variable and imbedded Markov chain techniques, the steady-state system length distributions at pre...
متن کاملFast Polarization for Processes with Memory
Fast polarization is crucial for the performance guarantees of polar codes. In the memoryless setting, the rate of polarization is known to be exponential in the square root of the block length. A complete characterization of the rate of polarization for models with memory has been missing. We consider polar codes for processes with memory that are characterized by an underlying ergodic finite ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- CoRR
دوره abs/1610.08881 شماره
صفحات -
تاریخ انتشار 2016